Ref 
# 


Hits 


Search Query 


DBS 


Default 
Operator 


Plurals 


Time Stamp 


SI 


2198 


newsgroup or Usenet 


US-PGPUB; 
USPAT; 

UbULK, 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


ON 


2006/01/05 11:38 


S2 


314 


(weighted adj graph) or (spectral 
adj clustering) 


US-PGPUB; 

USPAT; 

1 icr>PD- 
UbULK, 

EPO; JPO; 

DERWENT; 

IBM.TDB 


OR 


ON 


2006/01/04 15:51 


S3 


3 


SI and S2 


US-PGPUB; 
USPAT; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


ON 


2006/01/04 15:19 


S5 


2 


SI with S2 


US-PGPUB; 
USPAT; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


ON 


2006/01/04 15:19 


S6 


99308 


newsgroup or Usenet or forum or 
(special adj interest adj group) or 
sig 


US-PGPUB; 

USPAT; 

1 \cr\r*D • 
UbULK, 

EPO; JPO; 

DERWENT; 

IBM.TDB 


OR 


ON 


2006/01/04 15:21 


S7 


8 


S6 and S2 


US-PGPUB; 

USPAT; 

UbOLK, 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


ON 


2006/01/06 14:23 


S8 


2 


S6 same S2 


US-PGPUB; 
USPAT; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


ON 


2006/01/04 15:22 


S9 


2 


S6 and (cross-post$3 or 
crosspost$3) and graph 


US-PGPUB; 
USPAT; 

UbULK; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


ON 


2006/01/04 15:44 


SIO 


1 


"5796393".PN. 


USPAT; 
USOCR 


OR 


OFF 


2006/01/04 15:45 



Search History 1/6/2006 2:40:25 PM Page 1 
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Sll 


1 


"6215495".PN. 


USPAT; 
USOCR 


OR 


OFF 


2006/01/04 15:46 


S12 


6 


("5796393" | "6215495" | "6266805" 
1 "6289299" | "6295514").PN. OR 
("6594673").URPN. 


US-PGPUB; 

USPAT; 

USOCR 


OR 


OFF 


2006/01/04 15:46 


S13 


280 


(weighted adj graph) 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


ON 


2006/01/04 15:51 


S14 


280 


(weighted adj graph) 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


ON 


2006/01/05 11:37 


S15 


37 


S14 and "707".clas. 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


ON 


2006/01/04 15:54 


S16 


2198 


newsgroup or Usenet 


US-PGPUB; 

USPAT; 

UbOCR, 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


ON 


2006/01/04 16:23 


S17 


314 


(weighted adj graph) or (spectral 
adj clustering) 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


ON 


2006/01/06 14:32 


S18 


2 


S16 same S17 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBI^_TDB 


OR 


ON 


2006/01/04 17:16 


S19 


5 


S16 and (aoss-post$3 or cross adj 
post$3 or aosspost$3) 


US-PGPUB; 
USPAT; 

UbOLR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


ON 


2006/01/04 17:17 
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S20 


2206 


newsgroup or usenet 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


ON 


2006/01/05 11:42 


S21 


3 


S20 and (weighted adj graph) 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM.TDB 


OR 


ON 


2006/01/05 11:37 


S22 


4 


(newsgroup or usenet) and 
aoss-post$3 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM.TDB 


OR 


ON 


2006/01/05 11:38 


S23 


284 


weighted adj graph 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


ON 


2006/01/05 11:43 


S24 


60 


spectral adj duster$3 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


ON 


2006/01/05 11:44 


S25 


180685 


(newsgroup$l board adj message$l 
bbs bulletin) 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM.TDB 


OR 


OFF 


2006/01/06 14:06 


S26 


959163 


(graph$l chart$l) 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


OFF 


2006/01/06 14:03 


S27 


24380 


S25 and S26 


US-PGPUB; 
USPAT; 
USOCR; 
EPO; JPO; 
DERWENT; 
IBM TDB 


OR 


OFF 


2006/01/06 14:04 
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S28 


m 


525 same 526 


U5-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


OFF 


2006/01/06 14:04 


S29 


5915188 


(generat$3 creat$3) 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


OFF 


2006/01/06 14:05 


S30 


959163 


(graph$l chart$l) 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM.TDB 


OR 


OFF 


2006/01/06 14:04 


S31 


3146670 


(cluster$3 group$3) 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBMJTDB 


OR 


OFF 


2006/01/06 14:05 


S32 


6584716 


(genetcit$3 creat$3 build$3) 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


OFF 


2006/01/06 14:06 


S33 


192723 


(newsgroup$l board adj message$l 
bbs bulletin fomm) 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


OFF 


2006/01/06 14:06 


S34 


14483 


533 and S31 and 532 and 530 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


OFF 


2006/01/06 14:08 


S35 


3 


533 with 531 with 532 with 530 


US-PGPUB; 
USPAT; 
USOCR; 
EPO; JPO; 
DERWENT; 
IBM TDB 


OR 


OFF 


2006/01/06 14:16 
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S36 


26 


S33 same S31 same S32 same S30 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


OFF 


2006/01/06 14:10 


S37 


2 


"5923846".pn. 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


OFF 


2006/01/06 14:18 


S38 


2 


"6336132".pn. 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


OFF 


2006/01/06 14:18 


S39 


1 


"6952700".pn. 


USPAT 


OR 


OFF 


2006/01/06 14:31 


S40 


164662 


"707" .das. "709".clas. "345".clas. 
"711".clas. 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM.TDB 


OR 


ON 


2006/01/06 14:35 


S41 


318 


(weighted adj graph) or (spectral 
adj clustering) 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


ON 


2006/01/06 14:35 


S42 


85 


S41 and S40 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


ON 


2006/01/06 14:35 


S43 


959163 


(graph$l chart$l) 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBMJTDB 


OR 


OFF 


2006/01/06 14:35 


S44 


3146670 


(cluster$3 group$3) 


US-PGPUB; 
USPAT; 
USOCR; 
EPO; JPO; 
DERWENT; 
IBM TDB 


OR 


OFF 


2006/01/06 14:35 
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S45 


6584716 


(generat$3 creat$3 build$3) 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


OFF 


2006/01/06 14:35 


S46 


192723 


(newsgroup$l board adj message$l 
bbs bulletin forum) 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


OFF 


2006/01/06 14:35 


S47 


14483 


S46 and S44 and S45 and S43 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


OFF 


2006/01/06 14:35 


S48 


2462 


S47 and S40 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


ON 


2006/01/06 14:37 


S49 


26 


S46 same S44 same S45 same S43 


US-PGPUB; 

USPAT; 

USOCR; 

EPO; JPO; 

DERWENT; 

IBM_TDB 


OR 


OFF 


2006/01/06 14:37 


S50 


13 


S49 and S40 


US-PGPUB; 
USPAT; 

UbULK, 

EPO; JPO; 
DERWENT; 
IBM TDB 


OR 


ON 


2006/01/06 14:38 
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Google ^ 



Scholar w beta 




Lowercase "or" was ignored. Try "OR" to search for either of two terms, [ details ] 
The "AND" operator is unnecessary - we include all search terms by default, [ details] 



Scholar 



Results 1 - 4 of 4 for ( newsgroup or usenet ) and (" weighted graph "). (0.05 seconds) 



Tip: Try removing quotes from your search to get more results. 
Exploring the community structure of newsgrou ps 

C Borgs, J Chayes, M Mahdian, A Saber! - Proceedings of the 2004 ACM SIGKDD international conference 2004 - 
portal.acm.org 

... the cross-post graph as a weighted graph with vertices ... http://research. microsoft. 
com/jchayes/Papers/usenet.html. ... to help users find the right newsgroup to post ... 

Cited by 2 - Web Sea rch - research.mi crosoft.c om - research.microspft.com - pprtaj., acm.org 

An Intelligent A g ent for Hi g h-Precision Text Filterin g 
A O'Riordan, H Sorensen - CIKM, 1995 - portal.acm.org 

... A weighted graph repre- sentation is used for documents, and graph manipula- 
tion algorithms are used in the processing. 1 Introduction ... 
Cited by_8 - VVeb Search - cds_ery4.jnria.fr - portal. acm.prg 

Analyzing the Effectiveness and Applicability of Co-training 

K Nigam, R Ghani - CIKM, 2000 - portal.acm.org 

... Thus, the word \career" from the rst newsgroup is a distinct feature from the ... When 

tokenizing this data, the UseNet headers (includ- ing the subject line) are ... 

Cited by 109 - Web Search - cs.wjjstl^du - kamalniqam.com - w ww- 2.cs.cmu.edu - alL1 3,yersions.» 

On Scaling Up Balanced Clustering Algorithms 

A Banerjee, J Ghosh - SDM, 2002 - lans.ece.utexas.edu 

... a graph partitioning problem [22]. A weighted graph is constructed whose 

vertices are the data-points. An edge connecting any two ... 

Cited by 9 - View as HTML - Web Search - lans.e ce. utexa s.edu 




Google Home - About Google - About Google Scholar 

©2005 Google 



fe PORTAL 



Subscribe (Full Service) Register (Limited Service. Free) Log in 



USPTO 



Search: € The ACM Digital Library C The Guide 
[ (new s group or Usenet) and (weighted and graph) 



Ternns used nevysgroup or usenet and wei ghted and graph 



Sort results | relevance ^ ^ Save results to a Binder 

by I -J pji 

i Search JjiDS 
^dedform n Open results i 
window 



Display 
results 



Feed back Report a problem Satis factio n 
surv ey 

Found 14,608 of 169,166 

Try an Advanced Search 

Try this search in Th e ACM Guide 



in a new 



Results 1 - 20 of 200 
Best 200 shown 



Result page: 123456Z89 



10 next 

Relevance scale □□Hi 



1 Industry/government track posters: Exploring the community structure of news g rou ps 
Christian Borgs, Jennifer Chayes, Mohammad Mahdian, Amin Saberi 
August 2004 Proceedings of the tenth ACM SIGKDD international conference on 

Knowledge discovery and data mining KDD '04 
Publisher: ACM Press 

Full text available: ^ pdf ( 3.11 MB) Additional infornnation: full citation , abstract , references , index terms 

We propose to use the community structure of Usenet for organizing and retrieving the 
information stored in newsgroups. In particular, we study the network formed by cross- 
posts, messages that are posted to two or more newsgroups simultaneously. We present 
what is, to our knowledge, by far the most detailed data that has been collected on 
Usenet cross-postings. We analyze this network to show that it is a small-world network 
with significant clustering. We also present a spectral algorithm which ... 



Keywords: clustering, spectral method, usenet 



Data mining: Minin g news g roups using networks arisin g from social behavior 
Rakesh Agrawal, Sridhar Rajagopalan, Ramakrishnan Srikant, Yirong Xu 
May 2003 Proceedings of the 12th international conference on World Wide Web 
Publisher: ACM Press 

Full text available* I?) pdf(299 31 KB) Additional Information: full citation , abstract , references , citin gs, index 

Recent advances in information retrieval over hyperlinked corpora have convincingly 
demonstrated that links carry less noisy Information than text. We investigate the 
feasibility of applying link-based methods in new applications domains. The specific 
application we consider is to partition authors into opposite camps within a given topic in 
the context of newsgroups. A typical newsgroup posting consists of one or more quoted 
lines from another posting followed by the opinion of the author. This ... 

Keywords: data mining, link analysis, newsgroup, social network, text mining, web 
mining 



3 Clusterin g : Bipartite gra ph partitionin g and da ta clus terin g 
^ Hongyuan Zha, Xiaofeng He, Chris Ding, Horst Simon, Ming Gu 

^ October 2001 Proceedings of the tenth international conference on Information and 
knowledge management 

Publisher: ACM Press 

Full text available- 151 pdfd 45 MB) Additional Information: full citation , abstract , references, citings, index 
. |A| *^^-^ terms 

Many data types arising from data mining applications can be modeled as bipartite 
graphs, examples include terms and documents in a text corpus, customers and 
purchasing items in market basket analysis and reviewers and movies in a movie 



recommender system. In this paper, we propose a new data clustering method based on 
partitioning the underlying bipartite graph. The partition is constructed by minimizing a 
normalized sum of edge weights between unmatched pairs of vertices of the ... 

Keywords: bipartite graph, correspondence analysis, document clustering, graph 
partitioning, singular value decomposition, spectral relaxation 



GroupLens: an open architecture for collaborative filtering of netnews 
Paul Resnick, Neophytos lacovou, Mitesh Suchak, Peter Bergstrom, John Riedl 
October 1994 Proceedings of the 1994 ACM conference on Computer supported 

cooperative woric 
Publisher: ACM Press 

Full text available* fiU pdfd 32 MB) Additional Information: full citation , abstract , references , citing s, index 

terms 

Collaborative filters help people make choices based on the opinions of other people. 
GroupLens is a system for collaborative filtering of netnews, to help people find articles 
they will like in the huge stream of available articles. News reader clients display 
predicted scores and make it easy for users to rate articles after they read them. Rating 
servers, called Better Bit Bureaus, gather and disseminate the ratings. The rating servers 
predict scores based on the heuristic that people wh ... 

Keywords: Usenet, collaborative filtering, electronic bulletin boards, information filtering, 
netnews, selective dissemination of information, social filtering, user model 



Fast detection of communication patterns in distributed executions 
Thomas Kunz, MIchiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced 

Studies on Collaborative research 
Publisher: IBM Press 

Full text available: ^ Ddf(4.21 MB) Additional Information: full citation , abstract , references , index terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based 
on process-time diagrams are often used to obtain a better understanding of the 
execution of the application. The visualization tool we use is Poet, an event tracer 
developed at the University of Waterloo. However, these diagrams are often very complex 
and do not provide the user with the desired overview of the application. In our 
experience, such tools display repeated occurrences of non-trivial commun ... 

The SIFT infornnation dissemination system 
Tak W. Yan, Hector Garcia-Molina 

December 1999 ACM Transactions on Database Systems (TODS), volume 24 issue 4 
Publisher: ACM Press 

Full text available* ^'l Ddf(220 77 KB) A^^**'^"^^ Information: full citation , abstract , references , citjngs, index 
• terms 

Information dissemination Is a powerful mechanism for finding information in wide-area 
environments. An information dissemination server accepts long-term user queries, 
collects new documents from information sources, matches the documents against the 
queries, and continuously updates the users with relevant information. This paper is a 
retrospective of the Stanford Information Filtering Service (SIFT), a system that as of 
April 1996 was processing over 40,000 worldwide subscriptions and ov ... 

Keywords: Boolean queries, dissemination, filtering, indexing, vector space queries 



Data minin g : A matrix density based al g orithnn to hierarchically co-cluster documents Q 
and words 

Bhushan MandhanI, Sachindra Joshi, Krishna Kummamuru 

May 2003 Proceedings of the 12th international conference on World Wide Web 
Publisher: ACM Press 



Full text available: ^ Ddfd 33.06 KB) Additional Information: full citation , abstract , reference s, citings, index 

terms 

This paper proposes an algorithm to hierarchically cluster documents. Each cluster is 
actually a cluster of documents and an associated cluster of words, thus a document-word 
co-cluster. Note that, the vector model for documents creates the document-word matrix, 
of which every co-cluster is a submatrix. One would intuitively expect a submatrix made 
up of high values to be a good document cluster, with the corresponding word cluster 
containing its most distinctive features. Our algorithm looks to ... 

8 O ptimistic re plicatio n 
Yasushi Saito, Marc Shapiro 

March 2005 ACM Computing Surveys (CSUR), volume 37 issue i 
Publisher: ACM Press 

Full text available: ^ pdf(656.72 KB ) Additional Information: full citation , abstra ct, references , index terms 

Data replication is a key technology In distributed systems that enables higher availability 
and performance. This article surveys optimistic replication algorithms. They allow replica 
contents to diverge in the short term to support concurrent work practices and tolerate 
failures in low-quality communication links. The importance of such techniques is 
increasing as collaboration through wide-area and mobile networks becomes 
popular.Optimistic replication deploys algorithms not seen in tradition ... 

Keywords: Replication, disconnected operation, distributed systems, large scale systems, 
optimistic techniques 



9 An intelligent a g ent for hi gh- precision text filterin g 
Adrian O'Riordan, Humphrey Sorensen 

December 1995 Proceedings of the fourth international conference on Information 

and linowledge management 
Publisher: ACM Press 

Full text available: ^ pdf(636.79 KB) Additional Information: full citation , references , citin gs, index term s 



0 Research session 5: data minin g / transaction mana g ement: A divide-and-mer ge 
methodology for clusterin g 

David Cheng, Santosh Vempala, Ravi Kannan, Grant Wang 

June 2005 Proceedings of the twenty-fourth ACM SIGI^OD-SIGACT-SIGART 
symposium on Principles of database systems 

Publisher: ACM Press 

Full text available: ^ pclf ( 791 .76 KB ) Additional Information: full citation , ab stract , references 

We present a divide-and-merge methodology for clustering a set of objects that combines 
a top-down "divide" phase with a bottom-up "merge" phase. In contrast, previous 
algorithms either use top-down or bottom-up methods to construct a hierarchical 
clustering or produce a flat clustering using local search (e.g., /c-means). Our divide phase 
produces a tree whose leaves are the elements of the set. For this phase, we use an 
efficient spectral algorithm. The merge phase quickly finds an optim ... 



1 Customizing information capture and access 
Daniela Rus, Devika Subramanian 

January 1997 ACM Transactions on Information Systems (TOIS), volume i5 issue i 
Publisher: ACM Press 

Full text available- pdf{1 26 MB) Additional Information: full citation , abstract , reference s, citings, index 
. [A| = terms, revieyy 

This article presents a customizable architecture for software agents that capture and 
access information in large, heterogeneous, distributed electronic repositories. The key 
idea is to exploit underlying structure at various levels of granularity to build high-level 
indices with task-specific interpretations. Information agents construct such indices and 
are configured as a network of reusable modules called structure detectors and 
segmenters. We illustrate our archltectu ... 



Keywords: information gathering, software agents, table recognition 



12 Communities: Flash forunns and forumReader: navi g atin g a new kind of large-scale 
^ online discussion 

^ Kushal Dave, Martin Wattenberg, Michael Muller 

November 2004 Proceedings of the 2004 ACM conference on Computer supported 

cooperative woric 
Publisher: ACM Press 

Full text available: ^ pdf(513.95 KB) Additional Information: full citation , abstract , references , index ter ms 

We describe a popular kind of large, topic-centered, transient discussion, which we term a 
<i>flash forum</i>. These occur in settings ranging from web-based bulletin boards to 
corporate intranets, and they display a conversational style distinct from Usenet and other 
online discussion. Notably, authorship is more diffuse, and threads are less deep and 
distinct. To help orient users and guide them to areas of interest within flash forums, we 
designed ForumReader, a tool combining data ... 

Keywords: collaboration, large-scale conversations, mass interaction, persistent 
conversations, prototype, thumbnail interface, user interface, user study, visualization 



13 Ex periments in social data minin g : The TopicShop s ystem 

Brian Amento, Loren Terveen, Will Hill, Deborah Hix, Robert Schulman 
^ March 2003 ACM Transactions on Computer*Human Interaction (TOCHI), volume lo issue 
1 

Publisher: ACM Press 

Full text available- 151 Ddf(377 92 KB) Additional Information: full citation , abstra ct, refer ences , citin gs, iadex 
terms 

Social data mining systems enable people to share opinions and benefit from each other's 
experience. They do this by mining and redistributing information from computational 
records of social activity such as Usenet messages, system usage history, citations, or 
hyperlinks. Some general questions for evaluating such systems are: (1) is the extracted 
information valuable? and (2) do interfaces based on the information improve user task 
performance? We report here on TopicShop, a syst ... 

Keywords: Cocitation analysis, collaborative filtering, computer-supported cooperative 
work, information visualization, social filtering, social network analysis 



14 Constructin g, or g anizin g, and visualizin g collections of topically related Web 
resources 

Loren Terveen, Will Hill, Brian Amento 

March 1999 ACM Transactions on Computer-Human Interaction (TOCHI), volume 6 issue 
1 

Publisher: ACM Press 

Full text available* IS Ddf(303 62 KB). Information: full citation , abstract , references , citings. Index 

' ^ ' terms 

For many purposes, the Web page is too small a unit of interaction and analysis. Web 
sites are structured multimedia documents consisting of many pages, and users often are 
Interested In obtaining and evaluating entire collections of topically related sites. Once 
such a collection is obtained, users face the challenge of exploring, comprehending and 
organizing the items. We report four innovations that address these user needs; (1) we 
replaced the Web page with the Web site 

Keywords: cocitation analysis, collaborative filtering, computer supported cooperative 
work, information visualization, social filtering, social network analysis 



Research track papers: A probabilistic framework for semi-supervised clustering 
^ Sugato Basu, Mikhail Bilenko, Raymond J. Mooney 

August 2004 Proceedings of the tenth ACM SIGKDD international conference on 



Knowledge discovery and data mining KDD '04 

Publisher: ACM Press 

Full text available: ^ pdf{187.51 KB) Additional Information: full citation , abstract , references , index terms 

Unsupervised clustering can be significantly improved using supervision in the form of 
pairwise constraints, i.e., pairs of instances labeled as belonging to same or different 
clusters. In recent years, a number of algorithms have been proposed for enhancing 
clustering quality by employing such supervision. Such methods use the constraints to 
either modify the objective function, or to learn the distance measure. We propose a 
probabilistic model for semi-supervised clustering based on Hidden Mar ... 

Keywords: distance metric learning, hidden Markov random fields, semi-supervised 
clustering 



16 Recommender systems and social computing: Recommending collaboration with 
^ social networks: a comparative evaluation 
^ David W. McDonald 

April 2003 Proceedings of the SIGCHI conference on Human factors in computing 
systems 

Publisher: ACM Press 

Full text available* 13 Ddf(489 61 KB) Additional Information: full citation , abstract , references , citin gs, index 
'•^■^ terms 

Studies of information seeking and workplace collaboration often find that social 
relationships are a strong factor in determining who collaborates with whom. Social 
networks provide one means of visualizing existing and potential interaction in 
organizational settings. Groupware designers are using social networks to make systems 
more sensitive to social situations and guide users toward effective collaborations. Yet, 
the implications of embedding social networks in systems have not been system ... 
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The explosive growth of the world-wide-web and the emergence of e-commerce has led to 
the development of recommender systems— a personalized Information filtering 
technology used to identify a set of items that will be of interest to a certain user. User- 
based collaborative filtering is the most successful technology for building recommender 
systems to date and is extensively used in many commercial recommender systems. 
Unfortunately, the computational complexity of these methods grows I ... 
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Weblogs and message boards provide online forums for discussion that record the voice of 
the public. Woven into this mass of discussion is a wide range of opinion and commentary 
about consumer products. This presents an opportunity for companies to understand and 
respond to the consumer by analyzing this unsolicited feedback. Given the volume, format 




and content of the data, the appropriate approach to understand this data is to use large- 
scale web and text data mining technologies.This paper ar ... 
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We propose to use the community structure of Usenet for organizing and retrieving the 
information stored in newsgroups. In particular, we study the network formed by cross- 
posts, messages that are posted to two or more newsgroups simultaneously. We present 
what is, to our knowledge, by far the most detailed data that has been collected on 
Usenet cross-postings. We analyze this network to show that it is a small-world network 
with significant clustering. We also present a spectral algorithm which ... 
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In this paper we describe an evaluation of behavioral descriptors generated from an 
analysis of a large collection of Usenet newsgroup messages. The metrics describe 
aspects of newsgroup authors' behavior over time; such information can aid in filtering, 
sorting, and recommending content from public discussion spaces like newsgroups. To 
assess the value of a variety of these behavioral descriptors, we compared 22 
participants' subjective evaluations of authors whose messages they read to behavio ... 
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UCNS is pleased to announce the availability of a new Usenet News server on campus, 
news.uga.edu. Usenet News is an electronic public forum in which you can participate In 
discussions and exchange information with millions of people around the world. This 
article will explain what Usenet News is, how it works, and how you can take advantage of 
this tremendous resource. 
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The Netscan project helps online participants form cooperative relationships by offering a 
better sense of the other players involved. 
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Collaborative filters help people make choices based on the opinions of other people. 
GroupLens is a system for collaborative filtering of netnews, to help people find articles 
they will like in the huge stream of available articles. News reader clients display 
predicted scores and make it easy for users to rate articles after they read them. Rating 
servers, called Better Bit Bureaus, gather and disseminate the ratings. The rating servers 
predict scores based on the heuristic that people wh ... 

Keywords: Usenet, collaborative filtering, electronic bulletin boards, information filtering, 
netnews, selective dissemination of information, social filtering, user model 



GroupLens: a pplying collaborative filterin g to Usenet news 

Joseph A. Konstan, Bradley N. Miller, David Maltz, Jonathan L. Herlocker, Lee R. Gordon, 
John Riedl 

March 1997 Communications of the ACM, volume 40 issue 3 
Publisher: ACM Press 

Full text available: ^ pdf(343.16 KB) Additional Information: full citation , references , citing s, index terms 



Building task-specific interfaces to hi g h volume conversational data 

Loren G. Terveen, William C. Hill, Brian Amento, David McDonald, Josh Creter 

March 1997 Proceedings of the SIGCHI conference on Human factors in computing 

systems 
Publisher: ACM Press 

Full text available: '^Ddf (908.00 KB ) Additional Information: full citation , references , citing s, index term s 



Keywords: Netnews, Usenet, World Wide Web, collaborative filtering, computer- 
supported cooperative work, data mining, human interface, human-computer interaction, 
organlzatinal computing, resource discovery, social filtering 



9 Text Ex traction and Su mnnarization: Text cl assifi cation in a hierar ch i ca I jriixt ure 
model for small trainin g sets 

Kristlna Toutanova, Francine Chen, Kris Popat, Thomas Hofmann 
October 2001 Proceedings of the tenth international conference on Information and 
icnowiedge management 

Publisher: ACM Press 

Full text available* 151 Pdf(1 40 MB) Additional Information: ful l citation , abstract , r eference s, dtings, index 

. |Aj ^ - terms 

Documents are commonly categorized into hierarchies of topics, such as the ones 
maintained by Yahoo! and the Open Directory project, In order to facilitate browsing and 
other interactive forms of information retrieval. In addition, topic hierarchies can be 
utilized to overcome the sparseness problem in text categorization with a large number of 
categories, which is the main focus of this paper. This paper presents a hierarchical 
mixture model which extends the standard naive Bayes classif ... 
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l< Is the most important parameter in a text categorization system based on the /c-nearest 
neighbor algorithm (/cNN). To classify a new document, the /c-nearest documents in the 
training set are determined first. The prediction of categories for this document can then 
be made according to the category distribution among the k nearest neighbors. Generally 
speaking, the class distribution in a training set Is not even; some classes may have more 
samples than others. ... 
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We present a divide-and-merge nnethodology for clustering a set of objects that combines 
a top-down "divide" phase with a bottom-up "merge" phase. In contrast, previous 
algorithms either use top-down or bottom-up methods to construct a hierarchical 
clustering or produce a flat clustering using local search (e.g., /f-means). Our divide phase 
produces a tree whose leaves are the elements of the set. For this phase, we use an 
efficient spectral algorithm. The merge phase quickly finds an optim ... 
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One of the most challenging problems facing builders and facilitators of community 
networks is to create and sustain social engagement among members. In this paper, we 
investigate the drivers of social engagement in a community network through the analysis 
of three data sources: activity logs, a member survey, and the content analysis of the 
conversation archives. We describe three important ways to encourage and support social 
engagement in online communities: through system design elements sue ... 
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Two-dimensional contingency or co-occurrence tables arise frequently in important 
applications such as text, web-log and market-basket data analysis. A basic problem in 
contingency table analysis Is co-clustering: simultaneous clustering of the rows and 
columns. A novel theoretical formulation views the contingency table as an empirical joint 
probability distribution of two discrete random variables and poses the co-clustering 
problem as an optimization problem in information theory 
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Dyadic data nnatrices, such as co-occurrence matrix, rating matrix, and proximity matrix, 
arise frequently in various important applications. A fundamental problem in dyadic data 
analysis is to find the hidden block structure of the data matrix. In this paper, we present 
a new co-clustering framework, block value decomposition(BVD), for dyadic data, which 
factorlzes the dyadic data matrix Into three components, the row-coefficient matrix R, the 
block value matrix B, and the column-c ... 
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While the vast majority of clustering algorithms are partitional, many real world datasets 
have inherently overlapping clusters. Several approaches to finding overlapping clusters 
have come from work on analysis of biological datasets. In this paper, we interpret an 
overlapping clustering model proposed by Segal et al. [23] as a generalization of 
Gaussian mixture models, and we extend it to an overlapping clustering model based on 
mixtures of any regular exponential family distribution and the c ... 
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The number, size, and user population of bibliographic and full-text document databases 
are rapidly growing. With a high document arrival rate, it becomes essential for users of 
such databases to have access to the very latest documents; yet the high document 
arrival rate also makes it difficult for users to keep themselves updated. It is desirable to 
allow users to submit profiles, I.e., queries that are constantly evaluated, so that they will 
be automatically informed of new additions tha ... 



Results 1-20 of 40 Result page: 12 3 next 

The ACM Portal is published by the Association for Computing Machinery. Copyright © 2006 ACM, Inc. 
Terms of Usa ge Privacy Policy C ode of Ethics Contact U s 



Useful downloads: § Ado be Acro bat Q QuickTime B Windows Media Pla yer ^> R eal Play er 



